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We show that heterogeneous degree distributions in observed scale-free topologies of complex 
networks can emerge as a consequence of the exponential expansion of hidden hyperbolic space. 
Fermi-Dirac statistics provides a physical interpretation of hyperbolic distances as energies of links. 
The hidden space curvature affects the heterogeneity of the degree distribution, while clustering is 
a function of temperature. We embed the Internet into the hyperbolic plane, and find a remarkable 
congruency between the embedding and our hyperbolic model. Besides proving our model realistic, 
this embedding may be used for routing with only local information, which holds significant promise 
for improving the performance of Internet routing. 
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Many complex networks possess heterogeneous degree 
distributions. This heterogeneity is often modeled by 
power laws, often truncated |T|. These networks also 
exhibit strong clustering, i.e., high concentration of tri- 
angular subgraphs. Our previous work [2] demonstrated 
that the clustering peculiarities of complex networks, and 
in particular their self-similarity, finds a natural geomet- 
ric explanation in the existence of hidden metric spaces 
underlying the network and abstracting the intrinsic sim- 
ilarities between its nodes. Here we seek to provide a 
geometric interpretation of the first property — network 
heterogeneity. We show that heterogeneous, or scale-free, 
degree distributions in complex networks appear as a sim- 
ple consequence of negative curvature of hidden spaces. 
That is, we argue that these spaces are hyperbolic. 

The main metric property of hyperbolic geometry is 
the exponential expansion of space, see Fig. [I] left. 
For example, in the hyperbolic plane, i.e., the two- 
dimensional space of constant curvature —1, the length 
of a circle and the area of a disc of radius R are 2w sinh R 
and 27r(coshi? — 1), both growing as ~ e R . The hyper- 
bolic plane is thus metrically equivalent to an e-ary tree, 
i.e., a tree with the average branching factor equal to 
e. Indeed, in a 6-ary tree the surface of a sphere or the 
volume of a ball of radius R, measured as the number of 
nodes lying at or within R hops from the root, grow as b R . 
Informally, hyperbolic spaces can therefore be thought of 
as "continuous versions" of trees. 

To see why this exponential expansion of hidden space 
is intrinsic to complex networks, observe that their topol- 
ogy represents the structure of connections or interac- 
tions among distinguishable, heterogeneous elements ab- 
stracted as nodes. This heterogeneity implies that nodes 
can be somehow classified, however broadly, into a tax- 
onomy, i.e., nodes can be split into large groups consist- 
ing of smaller subgroups, which in turn consist of even 
smaller subsubgroups. The relationships between such 
groups and subgroups can be approximated by tree-like 
structures, sometimes called dendrograms, in which the 




FIG. 1: Left: Artistic visualization of the Poincare disc 
model of the hyperbolic plane H 2 by Levy, based on Escher's 
Circle Limit III, with the permission from the Geometry Cen- 
ter, University of Minnesota. The exponential expansion of 
fish illustrates the exponential expansion of hyperbolic space. 
All fish are of the same hyperbolic size, but their Euclidean 
size exponentially decreases, while their number exponen- 
tially increases with the distance from the origin. Right: 
A modeled network with N = 740 nodes, power-law expo- 
nent 7 = 2.2, and average degree k « 5 embedded in the 
hyperbolic disc of curvature K — —1 and radius R « 15.5. 
The Euclidean distance between a node and the origin at the 
disc center, shown as the cross, represents the true hyper- 
bolic distance between the two. But the Euclidean distance 
between any two other nodes is not equal to the hyperbolic 
distance between them, as indicated by the peculiar shape of 
the shaded hyperbolic disc centered at the circled node lo- 
cated at distance r = 10.6 from the origin. The hyperbolic 
radius of this disc is also R, and according to the model, 
the circled node is connected to all the nodes lying in this 
disc. The curves show the hyperbolically straight lines, i.e., 
geodesies, connecting the circled node and some nodes in its 
disc. 



distance between two nodes estimates how similar they 
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are [3]. Importantly, the node classification hierarchy 
need not be strictly a tree. Approximate "tree-ness," 
which can be formally expressed solely in terms of the 
metric structure of a space jl], makes the space hyper- 
bolic. 

Let us see what network topologies emerge in the sim- 
plest possible settings involving hidden hyperbolic metric 
spaces. Let us form a network of N S> 1 nodes located 
in the hyperbolic plane H 2 . Since the number of nodes is 
finite, the area that nodes occupy is bounded. Let R ^> 1 
be the radius of a disc within which nodes are uniformly 
distributed. In hyperbolic geometry, this means that 
nodes are given an angular coordinate randomly dis- 
tributed in [0, 2n), and a radial coordinate r following the 
density p(r) = sinh r/ (cosh R— 1) w e r ~ R . Next, we have 
to specify the connection probability p{x) that two nodes 
at hyperbolic distance x are connected. We first consider 
the simplest case, the step function p(x) — Q(R — x), 
and justify this choice later. This p{x) connects each 
pair of nodes if the hyperbolic distance between them is 
not larger than R. 

The network is now formed, and we can compute the 
average degree k(r) of nodes at distance r from the disc 
center. These nodes are connected to all nodes in the 
intersection area of the two discs of the same radius R, 
one in which all nodes reside, and the other centered at 
distance r from the center of the first disc, see Fig. [T] 
right. Since the node distribution is uniform, k(r) is pro- 
portional to the area of this intersection, which decreases 
exponentially with r, k(r) ~ e~ r / 2 . Therefore, the in- 
verse function is logarithmic, f{k) ~ — 21n/c, and the 
node degree distribution in the network is approximately 
a power law, P(k) « p[r(fc)] |r'(fc)| ~ fc~ 3 . If we general- 
ize the space curvature to K = —( 2 , C > 0, and the node 
density to p(r) ae a ^~~ R ', where we can think of a > 
as the logarithm of the average branching factor in the 
underlying hierarchy, then the average degree at radius 
r scales as fc(r) ~ e _< » r / 2 if a/£ ^ 1/2, or k(r) ~ e~ ar 
otherwise, so that the node degree distribution becomes 
P{k) ~ fc-f with 




if a/C> 1/2, 
otherwise. 
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To fix the average degree in the network, we have to 
choose N — ce? R , where c is a constant. The result 
in Eq. ([I]) is remarkable as it shows that heterogeneous 
degree distributions may emerge as a simple consequence 
of the exponential expansion of hyperbolic space. 

However, our choice of the step-function connection 
probability is not yet justified. To justify it, and to show 
that scale-free networks have effective hyperbolic geome- 
tries underneath, we recall the S 1 model introduced in |2J. 
In that model, networks are constructed as follows. First, 
distribute N nodes uniformly over the circle S 1 of ra- 
dius N/(2tt), so that the node density on the circle is 
fixed to 1. Second, assign to all nodes an additional 
hidden variable k representing their expected degrees. 



To generate scale-free networks, the variable k is power- 
law distributed according to p(n) — Kq _1 (7 — 1)k -7 , 
k 6 [ko,oo), where k is the minimum expected degree. 
Finally, let k and k! be the expected degrees of two nodes 
located at distance d = NAO/ (2ir) measured over the cir- 
cle (A8 is the angular distance between the nodes). We 
connect each pair of nodes with probability p(x) i where 
X = d/(p,KK r ), and constant p fixes the average degree in 
the network. 

The key point is that the connection probability p{x) 
can be any integrable junction. As long as the distance 
over the circle is rescaled as x ~ cI/(kk'), any integrable 
p(x) guarantees that the expected degree of nodes with 
hidden variable k is indeed k, k(n) = k, so that 7, which 
is a model parameter, is indeed the exponent of the de- 
gree distribution in generated networks. 

We now want to map the expected degree k of each 
node to a radial position r within a disk of radius R, such 
that after the mapping, the radial distribution of nodes 
is p(r) w ae a ( r ~ R \ i.e., as in the hyperbolic H 2 model 
introduced above. To have this p(r), we must select the 
k — ► t mapping according to 
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where £ is fixed by the values of 7 and target a. We see 
that k(t) and consequently k(r) scale with r as in the H 2 
model, while the connection probability p{x) becomes 

p (ez( x ~ R ^J , where the effective distance 
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is approximately equal to the hyperbolic distance be- 
tween the two nodes in the disk. Indeed, the true hy- 
perbolic distance x between two points with polar co- 
ordinates (r, 6) and (r',9 1 ) in the hyperbolic space H 2 
of curvature K — — £ 2 is cosh (x = cosh £r cosh £r' — 
sinh sinh £r' cos AO, which for sufficiently large £r, £r', 
and AO > 2\J e~ 2 ^ r + e~ 2 ^ r ' is closely approximated by 
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Since the effective and true hyperbolic distances in 
Eqs. (|3|4| are approximately equal, the value of £ in 
Eq. (I2J is indeed the square root of curvature of the hy- 
perbolic disc, in agreement with Eq. |l]) in the H 2 model. 
We also notice that since the connection probability p(x) 
in the S 1 model can be any integrable function, the con- 
nection probability p{x) in the H 2 model can be any func- 
tion of the form p(x) = p ^e§( a: ~ i? )^ . 

Given this freedom of choice of the connection proba- 
bility, let us consider the family of functions 
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parameterized by T > 0. One motivation to focus on this 
family is that it generates exponential random graphs in 
the statistical mechanics sense [5]. Eq. (5]) is nothing but 
the grand canonical Fermi-Dirac distribution, and T is 
the system temperature. From the physical perspective, 
graph edges are non-interacting fermions with energies 
equal to their hidden hyperbolic lengths, and R is the 
chemical potential defined by the condition that kN/2, 
the number of edges- fermions, is fixed on average. At 
T — > Eq. ([5| converges to p(x) — Q(R — x), which a 
posteriori justifies our choice of the step function connec- 
tion probability in the H 2 model. 

The dependence on temperature in the model is pecu- 
liar. At zero temperature, the network is in the strongly 
degenerate ground state. As we heat it up, particles ex- 
plore higher-energy states, i.e., edges connect longer dis- 
tances, which affects clustering. At T — > 0, clustering is 
maximized. It monotonically decreases with T, and at 
T — > 1 we have a phase transition with clustering go- 
ing to zero, and the network losing its cold-state metric 
structure. In the cold regime with T < 1, the exponent of 
the degree distribution 7 depends only on the ratio a/Q 
via Eq. (JT|). Therefore, we can set a = 1/2 without loss 
of generality, so that 7 = l/£ + 1 is fully defined by cur- 
vature K > — 1. In the hot regime with T > 1, clustering 
remains zero, the chemical potential is no longer given 
by N — cez R but by N — cew R , and 7 also depends 
on temperature, 7 = T/£ + 1. Therefore at T — > 00 the 
graph ensemble is identical to classical random graphs, 
as all fermions are uniformly distributed across all ener- 
gies, i.e., all pairs of nodes are connected with the same 
probability independent of the hidden distance between 
them, and the network loses its cold-state hierarchical 
structure. Combining the cold and hot regimes, 
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T/C + 1 if T > 1 and ( < T, 
2 otherwise. 
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Finally, constant c fixing the average degree in the net- 
work is 
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The H 2 model can thus generate classical random graphs 
and scale- free networks with any average degree, power- 
law exponent 7 > 2, and clustering. In Fig. [2] left, we 
see that the curvature and temperature of the Internet 
are approximately K = —0.83 and T = 0.6 ± 0.1. 

Eq. d2J) establishes a formal equivalence between the S 1 
and Ermodels we introduced in |2] and here. The two 
models generate similar network topologies thanks to the 
similarity between the effective and true hyperbolic dis- 
tances in Eqs. ( 3|4 > . However, if we are to study other 
geometric properties of these networks, such as their nav- 
igability [7J, then it does matter a lot what distances, 
spherical d native to S 1 or hyperbolic x native to H 2 , 
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FIG. 2: Networks in the H 2 model vs. the Internet. Left: The 
degree distribution P(k) and degree-dependent clustering co- 
efficient c(k) are shown for the skitter (k = 6.29, C = 0.46) 
and Border Gateway Protocol (BGP) (k = 4.68, C = 0.29) 
views of the Internet from |^ , and for modeled networks with 
curvature K — —0.83 and two values of temperature T, 0.47 
(k = 6.03, C = 0.44) and 0.71 (k = 4.85, C = 0.25). Right: 
The empirical connection probability in the hyperbolically 
embedded Internet, compared to Eq. ([5j . 



we use to navigate a network. The latter distances x 
are dominated by r + r' , minus some small f5-dependent 
corrections. This effect can be observed in Fig. [T] right, 
where we show some hyperbolic geodesies between nodes 
in a small modeled network. These geodesies follow 
closely the radial directions between the nodes and the 
origin, i.e., they follow the same pattern as the shortest 
paths in the embedded network. Spherical distances d 
are at the other extreme, as their gradient lines lie in the 
orthogonal tangential directions. 

To demonstrate how such differences in distance cal- 
culations affect the efficiency of transport processes on 
networks, we embed the real Internet topology from [5] 
into H 2 using maximum-likelihood techniques. Specif- 
ically, we first assign to nodes random angular coordi- 
nates, while their radial coordinates are fixed by Eq pi. 
We then execute the Metropolis-Hastings algorithm ^9 
by moving random nodes to new locations with the same 
radial coordinate but with a randomly chosen new an- 
gular coordinate. We accept each move with probability 
min(l, L a / Lb), where Lb and L a are the likelihoods, be- 
fore and after the move, that the network is produced by 
our H 2 model with parameters matching the Internet in 
Fig.g left. Formally, L = Ui^PM^il-pixij)] 1 -^, 
where {<%} is the Internet adjacency matrix, and Xij is 
the hyperbolic distance between nodes i and j. 

After this process has converged, we perform greedy 
routing as in 71 in the resulting embedding. We ran- 
domly select a source, and try to find a path to a random 
destination by selecting the next node on the path as the 
current node's neighbor closest to the destination in H 2 . 
This process can be unsuccessful, as it can get stuck at 
intermediate nodes that have no neighbors closer to the 
destination than themselves. The percentage of success- 



4 



fill greedy paths and their hop-length averaged over 10 5 
random source-destination pairs are 94.5% and 3.95 (the 
average length of shortest paths is 3.46). For compari- 
son, the same numbers using the S 1 distances are 75.9% 
and 4.29. The reason for the exceptionally high ratio of 
successful paths in the H 2 case is that the shortest paths 
in the Internet stay close to the hyperbolic geodesies, fol- 
lowed by greedy navigation, between the corresponding 
source and destination. In other words, the real Internet 
topology is remarkably congruent with underlying hyper- 
bolic geometry. 

Even more striking in this regard is Fig. [2] right, where 
we show the empirical connection probability for the links 
vs. their hyperbolic distances in the embedded Internet, 
juxtaposed with the theoretical connection probability 
in our H 2 model. The similarity between the two pro- 
vides empirical evidence that our model reflects reality. 
If the real Internet were not congruent with our hyper- 
bolic model, then no maximum-likelihood technique used 
for its embedding would be able to make it such. 

In summary we have shown that complex network 
topologies are congruent with hyperbolic geometries. We 
can start with hyperbolic geometry, and show that it nat- 
urally gives rise to scale-free topology, or we can start 
with the latter, and show that hyperbolic geometry is 
its effective geometry. In this geometric approach, clus- 
tering and heterogeneous degree distributions appear as 
simple consequences of the metric and negative-curvature 
properties of hyperbolic spaces. In our hyperbolic model, 
the space curvature controls the heterogeneity of the de- 
gree distribution, while clustering is a function of tem- 
perature. Fermi-Dirac statistics provides a physical in- 
terpretation of hidden distances as energies of the corre- 
sponding links- fcrmions. This analogy may contribute to 
applications of the standard tools of statistical mechan- 
ics to the analysis of complex networks [5], which can 
be informally thought of as negatively curved containers 
of ultracold fermions. The Internet embedding, besides 
providing empirical evidence that our model is realistic, 



shows that the efficiency of transport processes without 
global knowledge is maximized if such processes use (ef- 
fective) hyperbolic distances. If networks evolve to be 
efficient with respect to their functions — and transport 
is one of such functions, — then this finding further sup- 
ports our hyperbolic metric space approach. 

The Internet embedding may also prove practically 
useful, since routing in it is extremely efficient and re- 
quires only local information about hyperbolic coordi- 
nates of node neighbors. Global knowledge of the large- 
scale Internet topology is a major scalability bottleneck 
in Internet routing today [10]. Another potential appli- 
cation of our work is protein folding, where hidden spaces 
are protein conformation energy profiles, and the protein 
folding process is greedy routing toward the minimum- 
energy conformation [11]. Yet another class of applica- 
tions involves cases where to have a right model for sim- 
ilarity distances is a key, such as recommender systems 
used by companies such as Amazon or Netflix. Their 
efficiency depends on how accurately the similarities be- 
tween consumers are estimated. Our hyperbolic explana- 
tion of the structure of complex networks is by no means 
the only possible mechanism capable of generating scale- 
free topologies with strong clustering. Therefore, the 
question of special interest is whether our explanation is 
(implicitly) equivalent to existing models, among which 
preferential attachment [H] appears to be most popular? 



Acknowledgments 

We thank A. Goltsev, S. Dorogovtsev, A. Samukhin, 
R. Pastor-Satorras, A. Baronchclli, M. Newman, 
J. Klcinberg, Z. Toroczkai, F. Menczer, A. Clauset, 
D. Clark, K. Fall, kc claffy, and others for useful dis- 
cussions and suggestions. This work was supported by 
NSF CNS-0434996, CNS-0722070, DHS N66001-08-C- 
2029, Cisco Systems, and FIS2007-66485-C02-02. 



[1] A. Clauset, C. R. Shalizi, and M. E. J. Newman, SIAM 

Rev (to appear) (2009). 
[2] M. A. Serrano, D. Krioukov, and M. Boguna, Phys Rev 

Lett 100, 078701 (2008). 
[3] D. J. Watts, P. S. Dodds, and M. E. J. Newman, Science 

296, 1302 (2002); A. Clauset, C. Moore, and M. E. J. 

Newman, Nature 453, 98 (2008). 
[4] M. Gromov, Metric Structures for Riemannian and Non- 

Riemannian Spaces (Birkhauser, Boston, 2007). 
[5] J. Park and M. E. J. Newman, Phys Rev E 70, 066117 

(2004); S. N. Dorogovtsev, J. F. F. Mendes, and A. N. 

Samukhin, Nucl Phys B 666, 396 (2003). 
[6] P. Mahadevan, D. Krioukov, M. Fomenkov, B. Huffaker, 

X. Dimitropoulos, kc claffy, and A. Vahdat, Comput 

Commun Rev 36, 17 (2006). 
[7] M. Boguna, D. Krioukov, and kc claffy, Nature Physics 

5, 74 (2009). 



[8] Y. Shavitt and E. Shir, Comput Commun Rev 35 (2005). 
[9] M. E. J. Newman and G. T. Barkema, Monte Carlo 

Methods in Statistical Physics (Clarendon Press, Oxford, 

1999). 

[10] D. Meyer, L. Zhang, and K. Fall, eds., RFC4984 (The 

Internet Architecture Board, 2007). 
[11] E. Ravasz, S. Gnanakaran, and Z. Toroczkai, arXiv: 
I 0705.0912| (2007). 

[12] A.-L. Barabasi and R. Albert, Science 286, 509 (1999). 



